Document Identiication for Copyright Protection Using Centroid Detection
نویسندگان
چکیده
A way to discourage illicit reproduction of copyrighted or sensitive documents is to watermark each copy before distribution. A unique mark is embedded in the text whose recipient is registered. The mark can be extracted from a possibly noisy illicit copy, identifying the registered recipient. Most image marking techniques are vulnerable to binarization attack and hence not suitable for text marking. We propose a diierent approach where a text document is marked by shifting certain text lines slightly up or down or words slightly left or right from their original positions. The shifting pattern constitutes the mark and is diierent on diierent copies. In this paper we develop and evaluate a method to detect such minute shifts. We describe a marking and identiication prototype that implements the proposed method. We present preliminary experimental results which connrms the analytical prediction that centroid detection performs remarkably well on line shifts even in the presence of severe distortions introduced by printing, photocopying, scanning, and facsimile transmission.
منابع مشابه
Performance Comparison of Two Text
A text document typically consists of a collection of regular structures such as words, lines and paragraphs, a slight movement of which seems less perceptible than, say, dithering of the document image. In this paper we exploit this property to watermark formatted text documents by shifting slightly certain lines and words, in order to discourage illicit distribution. We analyze two methods fo...
متن کاملDocument identification for copyright protection using centroid detection
A way to discourage illicit reproduction of copyrighted or sensitive documents is to watermark each copy before distribution. A unique mark is embedded in the text whose recipient is registered. The mark can be extracted from a possibly noisy illicit copy, identifying the registered recipient. Most image marking techniques are vulnerable to binarization attack and, hence, not suitable for text ...
متن کاملRobust Watermarking of Still Images for Copyright Protection
Digital watermarking has been proposed as a mean to protect the copyright of multimedia data in a networked environment, since it makes possible to tightly embed a code into a digital document allowing the identiication of the data owner. In this paper a new watermarking system for digital images is presented: the method embeds a sequence of random real numbers in a selected set of DCT coeecien...
متن کاملCentroid-based summarization of multiple documents
We present a multi-document summarizer, MEAD, which generates summaries using cluster centroids produced by a topic detection and tracking system. We describe two new techniques, a centroid-based summarizer, and an evaluation scheme based on sentence utility and subsumption. We have applied this evaluation to both single and multiple document summaries. Finally, we describe two user studies tha...
متن کاملحمایت از حق مؤلف در فضای سایبر در حقوق ملی و اسناد بینالمللی
Development of information technology and entrance to digital millennium confronted Copyright system with some serious challenges so that in some cases, protection of creators of digital works and protection of artistic and literary works in digital and cyber space and performance of this works in that space is in doubt. In order to removing this concerns and protection of copyright a...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2007